This document discusses using Hadoop MapReduce on Amazon Web Services for parallel processing of large datasets in a distributed manner. It provides an example of analyzing Wikipedia page view logs stored in S3 using Hadoop Streaming to run MapReduce jobs on EC2 instances with mappers and reducers written in Perl scripts. The jobs count page views to different pages and output the results.